Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 68205 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 8.8 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 9 |
age is highly overall correlated with age_years | High correlation |
weight is highly overall correlated with bmi | High correlation |
ap_hi is highly overall correlated with ap_lo and 2 other fields | High correlation |
ap_lo is highly overall correlated with ap_hi and 2 other fields | High correlation |
age_years is highly overall correlated with age | High correlation |
bmi is highly overall correlated with weight | High correlation |
bp_category is highly overall correlated with ap_hi and 2 other fields | High correlation |
bp_category_encoded is highly overall correlated with ap_hi and 2 other fields | High correlation |
gluc is highly imbalanced (52.2%) | Imbalance |
smoke is highly imbalanced (57.1%) | Imbalance |
alco is highly imbalanced (70.0%) | Imbalance |
id is uniformly distributed | Uniform |
id has unique values | Unique |
Reproduction
| Analysis started | 2023-11-01 17:42:52.377134 |
|---|---|
| Analysis finished | 2023-11-01 17:43:22.512755 |
| Duration | 30.14 seconds |
| Software version | ydata-profiling vv4.6.0 |
| Download configuration | config.json |
id
Real number (ℝ)
UNIFORM  UNIQUE 
| Distinct | 68205 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49972.41 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 533.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4952.2 |
| Q1 | 24991 |
| median | 50008 |
| Q3 | 74878 |
| 95-th percentile | 94933.4 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 49887 |
Descriptive statistics
| Standard deviation | 28852.138 |
|---|---|
| Coefficient of variation (CV) | 0.57736135 |
| Kurtosis | -1.1983614 |
| Mean | 49972.41 |
| Median Absolute Deviation (MAD) | 24949 |
| Skewness | -0.0014792948 |
| Sum | 3.4083683 × 109 |
| Variance | 8.3244588 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 66648 | 1 | < 0.1% |
| 66624 | 1 | < 0.1% |
| 66625 | 1 | < 0.1% |
| 66626 | 1 | < 0.1% |
| 66628 | 1 | < 0.1% |
| 66630 | 1 | < 0.1% |
| 66631 | 1 | < 0.1% |
| 66632 | 1 | < 0.1% |
| 66633 | 1 | < 0.1% |
| Other values (68195) | 68195 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 |
| Value | Count | Frequency (%) |
| 99999 | 1 | |
| 99998 | 1 | |
| 99996 | 1 | |
| 99995 | 1 | |
| 99993 | 1 | |
| 99992 | 1 | |
| 99991 | 1 | |
| 99990 | 1 | |
| 99988 | 1 | |
| 99986 | 1 |
age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8061 |
|---|---|
| Distinct (%) | 11.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19462.668 |
| Minimum | 10798 |
|---|---|
| Maximum | 23713 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 533.0 KiB |
Quantile statistics
| Minimum | 10798 |
|---|---|
| 5-th percentile | 15054.2 |
| Q1 | 17656 |
| median | 19700 |
| Q3 | 21323 |
| 95-th percentile | 23256 |
| Maximum | 23713 |
| Range | 12915 |
| Interquartile range (IQR) | 3667 |
Descriptive statistics
| Standard deviation | 2468.3819 |
|---|---|
| Coefficient of variation (CV) | 0.12682649 |
| Kurtosis | -0.82605314 |
| Mean | 19462.668 |
| Median Absolute Deviation (MAD) | 1713 |
| Skewness | -0.3048272 |
| Sum | 1.3274513 × 109 |
| Variance | 6092909 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19741 | 32 | < 0.1% |
| 18253 | 31 | < 0.1% |
| 21892 | 30 | < 0.1% |
| 18236 | 30 | < 0.1% |
| 18184 | 30 | < 0.1% |
| 20442 | 29 | < 0.1% |
| 19733 | 29 | < 0.1% |
| 20376 | 29 | < 0.1% |
| 20389 | 29 | < 0.1% |
| 21159 | 28 | < 0.1% |
| Other values (8051) | 67908 |
| Value | Count | Frequency (%) |
| 10798 | 1 | < 0.1% |
| 10859 | 1 | < 0.1% |
| 10878 | 1 | < 0.1% |
| 10964 | 1 | < 0.1% |
| 14275 | 1 | < 0.1% |
| 14277 | 1 | < 0.1% |
| 14282 | 1 | < 0.1% |
| 14284 | 1 | < 0.1% |
| 14287 | 1 | < 0.1% |
| 14291 | 3 |
| Value | Count | Frequency (%) |
| 23713 | 1 | |
| 23701 | 1 | |
| 23692 | 1 | |
| 23690 | 1 | |
| 23687 | 1 | |
| 23684 | 1 | |
| 23678 | 1 | |
| 23677 | 1 | |
| 23675 | 2 | |
| 23673 | 2 |
gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| 1 | |
|---|---|
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68205 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 44427 | |
| 2 | 23778 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 44427 | |
| 2 | 23778 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 44427 | |
| 2 | 23778 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68205 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 44427 | |
| 2 | 23778 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 44427 | |
| 2 | 23778 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 44427 | |
| 2 | 23778 |
height
Real number (ℝ)
| Distinct | 106 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 164.37286 |
| Minimum | 55 |
|---|---|
| Maximum | 250 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 533.0 KiB |
Quantile statistics
| Minimum | 55 |
|---|---|
| 5-th percentile | 152 |
| Q1 | 159 |
| median | 165 |
| Q3 | 170 |
| 95-th percentile | 178 |
| Maximum | 250 |
| Range | 195 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.1767564 |
|---|---|
| Coefficient of variation (CV) | 0.049745173 |
| Kurtosis | 7.6469704 |
| Mean | 164.37286 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.61186845 |
| Sum | 11211051 |
| Variance | 66.859345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 165 | 5728 | 8.4% |
| 160 | 4890 | 7.2% |
| 170 | 4584 | 6.7% |
| 168 | 4307 | 6.3% |
| 164 | 3323 | 4.9% |
| 158 | 3236 | 4.7% |
| 162 | 3180 | 4.7% |
| 169 | 2741 | 4.0% |
| 156 | 2681 | 3.9% |
| 167 | 2486 | 3.6% |
| Other values (96) | 31049 |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 65 | 2 | |
| 67 | 3 | |
| 68 | 2 | |
| 70 | 2 | |
| 71 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 250 | 1 | < 0.1% |
| 207 | 1 | < 0.1% |
| 198 | 14 | |
| 197 | 4 | < 0.1% |
| 196 | 6 | |
| 195 | 6 | |
| 194 | 2 | < 0.1% |
| 193 | 6 | |
| 192 | 12 | |
| 191 | 11 |
weight
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 278 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.100688 |
| Minimum | 11 |
|---|---|
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 533.0 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 55 |
| Q1 | 65 |
| median | 72 |
| Q3 | 82 |
| 95-th percentile | 100 |
| Maximum | 200 |
| Range | 189 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 14.288862 |
|---|---|
| Coefficient of variation (CV) | 0.19283036 |
| Kurtosis | 2.5571391 |
| Mean | 74.100688 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 1.0058102 |
| Sum | 5054037.4 |
| Variance | 204.17158 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 3779 | 5.5% |
| 70 | 3692 | 5.4% |
| 68 | 2767 | 4.1% |
| 75 | 2675 | 3.9% |
| 60 | 2670 | 3.9% |
| 80 | 2569 | 3.8% |
| 72 | 2249 | 3.3% |
| 69 | 2152 | 3.2% |
| 78 | 2035 | 3.0% |
| 74 | 1827 | 2.7% |
| Other values (268) | 41790 |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 30 | 3 | |
| 31 | 1 | < 0.1% |
| 32 | 3 | |
| 33 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 200 | 2 | |
| 183 | 1 | < 0.1% |
| 180 | 4 | |
| 178 | 3 | |
| 177 | 1 | < 0.1% |
| 172 | 1 | < 0.1% |
| 171 | 1 | < 0.1% |
| 170 | 3 | |
| 169 | 1 | < 0.1% |
| 168 | 3 |
ap_hi
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 86 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.43492 |
| Minimum | 90 |
|---|---|
| Maximum | 180 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 533.0 KiB |
Quantile statistics
| Minimum | 90 |
|---|---|
| 5-th percentile | 100 |
| Q1 | 120 |
| median | 120 |
| Q3 | 140 |
| 95-th percentile | 160 |
| Maximum | 180 |
| Range | 90 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 15.961685 |
|---|---|
| Coefficient of variation (CV) | 0.12624427 |
| Kurtosis | 0.76049898 |
| Mean | 126.43492 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.73995679 |
| Sum | 8623494 |
| Variance | 254.77539 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 120 | 27654 | |
| 140 | 9324 | 13.7% |
| 130 | 8907 | 13.1% |
| 110 | 8617 | 12.6% |
| 150 | 4197 | 6.2% |
| 160 | 2792 | 4.1% |
| 100 | 2563 | 3.8% |
| 90 | 928 | 1.4% |
| 170 | 647 | 0.9% |
| 180 | 602 | 0.9% |
| Other values (76) | 1974 | 2.9% |
| Value | Count | Frequency (%) |
| 90 | 928 | 1.4% |
| 93 | 1 | < 0.1% |
| 95 | 28 | < 0.1% |
| 96 | 2 | < 0.1% |
| 99 | 4 | < 0.1% |
| 100 | 2563 | |
| 101 | 4 | < 0.1% |
| 102 | 8 | < 0.1% |
| 103 | 9 | < 0.1% |
| 104 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 180 | 602 | |
| 179 | 4 | < 0.1% |
| 178 | 2 | < 0.1% |
| 177 | 2 | < 0.1% |
| 176 | 3 | < 0.1% |
| 175 | 14 | < 0.1% |
| 174 | 3 | < 0.1% |
| 173 | 2 | < 0.1% |
| 172 | 8 | < 0.1% |
| 171 | 8 | < 0.1% |
ap_lo
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.263925 |
| Minimum | 60 |
|---|---|
| Maximum | 120 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 533.0 KiB |
Quantile statistics
| Minimum | 60 |
|---|---|
| 5-th percentile | 70 |
| Q1 | 80 |
| median | 80 |
| Q3 | 90 |
| 95-th percentile | 100 |
| Maximum | 120 |
| Range | 60 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 9.1439848 |
|---|---|
| Coefficient of variation (CV) | 0.11252207 |
| Kurtosis | 0.93192798 |
| Mean | 81.263925 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.23882217 |
| Sum | 5542606 |
| Variance | 83.612458 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 34725 | |
| 90 | 14239 | |
| 70 | 10212 | 15.0% |
| 100 | 3978 | 5.8% |
| 60 | 2656 | 3.9% |
| 79 | 357 | 0.5% |
| 110 | 338 | 0.5% |
| 85 | 290 | 0.4% |
| 75 | 209 | 0.3% |
| 95 | 158 | 0.2% |
| Other values (48) | 1043 | 1.5% |
| Value | Count | Frequency (%) |
| 60 | 2656 | |
| 61 | 6 | < 0.1% |
| 62 | 7 | < 0.1% |
| 63 | 7 | < 0.1% |
| 64 | 10 | < 0.1% |
| 65 | 78 | 0.1% |
| 66 | 11 | < 0.1% |
| 67 | 19 | < 0.1% |
| 68 | 13 | < 0.1% |
| 69 | 98 | 0.1% |
| Value | Count | Frequency (%) |
| 120 | 134 | 0.2% |
| 119 | 2 | < 0.1% |
| 115 | 7 | < 0.1% |
| 114 | 1 | < 0.1% |
| 113 | 3 | < 0.1% |
| 112 | 1 | < 0.1% |
| 111 | 1 | < 0.1% |
| 110 | 338 | |
| 109 | 6 | < 0.1% |
| 108 | 3 | < 0.1% |
cholesterol
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68205 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 51222 | |
| 2 | 9191 | 13.5% |
| 3 | 7792 | 11.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 51222 | |
| 2 | 9191 | 13.5% |
| 3 | 7792 | 11.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 51222 | |
| 2 | 9191 | 13.5% |
| 3 | 7792 | 11.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68205 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 51222 | |
| 2 | 9191 | 13.5% |
| 3 | 7792 | 11.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 51222 | |
| 2 | 9191 | 13.5% |
| 3 | 7792 | 11.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 51222 | |
| 2 | 9191 | 13.5% |
| 3 | 7792 | 11.4% |
gluc
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| 1 | |
|---|---|
| 3 | 5180 |
| 2 | 4998 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68205 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 58027 | |
| 3 | 5180 | 7.6% |
| 2 | 4998 | 7.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 58027 | |
| 3 | 5180 | 7.6% |
| 2 | 4998 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 58027 | |
| 3 | 5180 | 7.6% |
| 2 | 4998 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68205 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 58027 | |
| 3 | 5180 | 7.6% |
| 2 | 4998 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 58027 | |
| 3 | 5180 | 7.6% |
| 2 | 4998 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 58027 | |
| 3 | 5180 | 7.6% |
| 2 | 4998 | 7.3% |
smoke
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| 0 | |
|---|---|
| 1 | 5979 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68205 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 62226 | |
| 1 | 5979 | 8.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 62226 | |
| 1 | 5979 | 8.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 62226 | |
| 1 | 5979 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68205 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 62226 | |
| 1 | 5979 | 8.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 62226 | |
| 1 | 5979 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 62226 | |
| 1 | 5979 | 8.8% |
alco
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| 0 | |
|---|---|
| 1 | 3624 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68205 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 64581 | |
| 1 | 3624 | 5.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 64581 | |
| 1 | 3624 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 64581 | |
| 1 | 3624 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68205 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 64581 | |
| 1 | 3624 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 64581 | |
| 1 | 3624 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 64581 | |
| 1 | 3624 | 5.3% |
active
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68205 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 54806 | |
| 0 | 13399 | 19.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 54806 | |
| 0 | 13399 | 19.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 54806 | |
| 0 | 13399 | 19.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68205 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 54806 | |
| 0 | 13399 | 19.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 54806 | |
| 0 | 13399 | 19.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 54806 | |
| 0 | 13399 | 19.6% |
cardio
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 68205 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 34533 | |
| 1 | 33672 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 34533 | |
| 1 | 33672 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 34533 | |
| 1 | 33672 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68205 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 34533 | |
| 1 | 33672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 34533 | |
| 1 | 33672 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 34533 | |
| 1 | 33672 |
age_years
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.823635 |
| Minimum | 29 |
|---|---|
| Maximum | 64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 533.0 KiB |
Quantile statistics
| Minimum | 29 |
|---|---|
| 5-th percentile | 41 |
| Q1 | 48 |
| median | 53 |
| Q3 | 58 |
| 95-th percentile | 63 |
| Maximum | 64 |
| Range | 35 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 6.7699095 |
|---|---|
| Coefficient of variation (CV) | 0.12816061 |
| Kurtosis | -0.82138264 |
| Mean | 52.823635 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.30356736 |
| Sum | 3602836 |
| Variance | 45.831674 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 55 | 3825 | 5.6% |
| 53 | 3751 | 5.5% |
| 57 | 3568 | 5.2% |
| 54 | 3530 | 5.2% |
| 56 | 3507 | 5.1% |
| 59 | 3484 | 5.1% |
| 49 | 3336 | 4.9% |
| 58 | 3312 | 4.9% |
| 51 | 3274 | 4.8% |
| 52 | 3194 | 4.7% |
| Other values (18) | 33424 |
| Value | Count | Frequency (%) |
| 29 | 3 | < 0.1% |
| 30 | 1 | < 0.1% |
| 39 | 1749 | |
| 40 | 1591 | |
| 41 | 1855 | |
| 42 | 1390 | |
| 43 | 1981 | |
| 44 | 1475 | |
| 45 | 2039 | |
| 46 | 1594 |
| Value | Count | Frequency (%) |
| 64 | 2122 | |
| 63 | 2652 | |
| 62 | 2135 | |
| 61 | 2647 | |
| 60 | 3127 | |
| 59 | 3484 | |
| 58 | 3312 | |
| 57 | 3568 | |
| 56 | 3507 | |
| 55 | 3825 |
bmi
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3752 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.510513 |
| Minimum | 3.4717839 |
|---|---|
| Maximum | 298.66667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 533.0 KiB |
Quantile statistics
| Minimum | 3.4717839 |
|---|---|
| 5-th percentile | 20.936639 |
| Q1 | 23.875115 |
| median | 26.346494 |
| Q3 | 30.116213 |
| 95-th percentile | 37.253489 |
| Maximum | 298.66667 |
| Range | 295.19488 |
| Interquartile range (IQR) | 6.2410984 |
Descriptive statistics
| Standard deviation | 6.026497 |
|---|---|
| Coefficient of variation (CV) | 0.2190616 |
| Kurtosis | 230.5905 |
| Mean | 27.510513 |
| Median Absolute Deviation (MAD) | 2.9229366 |
| Skewness | 7.8187178 |
| Sum | 1876354.6 |
| Variance | 36.318667 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23.87511478 | 930 | 1.4% |
| 23.4375 | 641 | 0.9% |
| 24.22145329 | 485 | 0.7% |
| 25.71166208 | 359 | 0.5% |
| 22.03856749 | 354 | 0.5% |
| 23.03004535 | 342 | 0.5% |
| 24.8015873 | 325 | 0.5% |
| 23.52941176 | 313 | 0.5% |
| 24.97704316 | 284 | 0.4% |
| 25.390625 | 279 | 0.4% |
| Other values (3742) | 63893 |
| Value | Count | Frequency (%) |
| 3.471783866 | 1 | |
| 7.022247758 | 1 | |
| 8.001828989 | 1 | |
| 9.331007343 | 1 | |
| 9.917581478 | 1 | |
| 10.7266436 | 1 | |
| 11.71875 | 1 | |
| 12.25447288 | 1 | |
| 12.85583104 | 1 | |
| 13.49300051 | 1 |
| Value | Count | Frequency (%) |
| 298.6666667 | 1 | |
| 278.125 | 1 | |
| 267.768595 | 1 | |
| 237.7686328 | 1 | |
| 191.6666667 | 1 | |
| 187.7500769 | 1 | |
| 180.6780742 | 1 | |
| 178.9627465 | 1 | |
| 178.2134106 | 1 | |
| 170.4142012 | 1 |
bp_category
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| Hypertension Stage 1 | |
|---|---|
| Hypertension Stage 2 | |
| Normal | |
| Elevated | 3101 |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 17.521443 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1195050 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hypertension Stage 1 |
|---|---|
| 2nd row | Hypertension Stage 2 |
| 3rd row | Hypertension Stage 1 |
| 4th row | Hypertension Stage 2 |
| 5th row | Normal |
Common Values
| Value | Count | Frequency (%) |
| Hypertension Stage 1 | 39750 | |
| Hypertension Stage 2 | 15937 | |
| Normal | 9417 | 13.8% |
| Elevated | 3101 | 4.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hypertension | 55687 | |
| stage | 55687 | |
| 1 | 39750 | |
| 2 | 15937 | 8.9% |
| normal | 9417 | 5.2% |
| elevated | 3101 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 173263 | |
| t | 114475 | 9.6% |
| 111374 | 9.3% | |
| n | 111374 | 9.3% |
| a | 68205 | 5.7% |
| r | 65104 | 5.4% |
| o | 65104 | 5.4% |
| H | 55687 | 4.7% |
| g | 55687 | 4.7% |
| y | 55687 | 4.7% |
| Other values (12) | 319090 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 904097 | |
| Uppercase Letter | 123892 | 10.4% |
| Space Separator | 111374 | 9.3% |
| Decimal Number | 55687 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 173263 | |
| t | 114475 | |
| n | 111374 | |
| a | 68205 | 7.5% |
| r | 65104 | 7.2% |
| o | 65104 | 7.2% |
| g | 55687 | 6.2% |
| y | 55687 | 6.2% |
| i | 55687 | 6.2% |
| s | 55687 | 6.2% |
| Other values (5) | 83824 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 55687 | |
| S | 55687 | |
| N | 9417 | 7.6% |
| E | 3101 | 2.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 39750 | |
| 2 | 15937 |
Space Separator
| Value | Count | Frequency (%) |
| 111374 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1027989 | |
| Common | 167061 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 173263 | |
| t | 114475 | |
| n | 111374 | |
| a | 68205 | 6.6% |
| r | 65104 | 6.3% |
| o | 65104 | 6.3% |
| H | 55687 | 5.4% |
| g | 55687 | 5.4% |
| y | 55687 | 5.4% |
| S | 55687 | 5.4% |
| Other values (9) | 207716 |
Common
| Value | Count | Frequency (%) |
| 111374 | ||
| 1 | 39750 | 23.8% |
| 2 | 15937 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1195050 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 173263 | |
| t | 114475 | 9.6% |
| 111374 | 9.3% | |
| n | 111374 | 9.3% |
| a | 68205 | 5.7% |
| r | 65104 | 5.4% |
| o | 65104 | 5.4% |
| H | 55687 | 4.7% |
| g | 55687 | 4.7% |
| y | 55687 | 4.7% |
| Other values (12) | 319090 |
bp_category_encoded
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 533.0 KiB |
| Hypertension Stage 1 | |
|---|---|
| Hypertension Stage 2 | |
| Normal | |
| Elevated | 3101 |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 17.521443 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1195050 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hypertension Stage 1 |
|---|---|
| 2nd row | Hypertension Stage 2 |
| 3rd row | Hypertension Stage 1 |
| 4th row | Hypertension Stage 2 |
| 5th row | Normal |
Common Values
| Value | Count | Frequency (%) |
| Hypertension Stage 1 | 39750 | |
| Hypertension Stage 2 | 15937 | |
| Normal | 9417 | 13.8% |
| Elevated | 3101 | 4.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hypertension | 55687 | |
| stage | 55687 | |
| 1 | 39750 | |
| 2 | 15937 | 8.9% |
| normal | 9417 | 5.2% |
| elevated | 3101 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 173263 | |
| t | 114475 | 9.6% |
| 111374 | 9.3% | |
| n | 111374 | 9.3% |
| a | 68205 | 5.7% |
| r | 65104 | 5.4% |
| o | 65104 | 5.4% |
| H | 55687 | 4.7% |
| g | 55687 | 4.7% |
| y | 55687 | 4.7% |
| Other values (12) | 319090 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 904097 | |
| Uppercase Letter | 123892 | 10.4% |
| Space Separator | 111374 | 9.3% |
| Decimal Number | 55687 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 173263 | |
| t | 114475 | |
| n | 111374 | |
| a | 68205 | 7.5% |
| r | 65104 | 7.2% |
| o | 65104 | 7.2% |
| g | 55687 | 6.2% |
| y | 55687 | 6.2% |
| i | 55687 | 6.2% |
| s | 55687 | 6.2% |
| Other values (5) | 83824 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 55687 | |
| S | 55687 | |
| N | 9417 | 7.6% |
| E | 3101 | 2.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 39750 | |
| 2 | 15937 |
Space Separator
| Value | Count | Frequency (%) |
| 111374 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1027989 | |
| Common | 167061 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 173263 | |
| t | 114475 | |
| n | 111374 | |
| a | 68205 | 6.6% |
| r | 65104 | 6.3% |
| o | 65104 | 6.3% |
| H | 55687 | 5.4% |
| g | 55687 | 5.4% |
| y | 55687 | 5.4% |
| S | 55687 | 5.4% |
| Other values (9) | 207716 |
Common
| Value | Count | Frequency (%) |
| 111374 | ||
| 1 | 39750 | 23.8% |
| 2 | 15937 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1195050 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 173263 | |
| t | 114475 | 9.6% |
| 111374 | 9.3% | |
| n | 111374 | 9.3% |
| a | 68205 | 5.7% |
| r | 65104 | 5.4% |
| o | 65104 | 5.4% |
| H | 55687 | 4.7% |
| g | 55687 | 4.7% |
| y | 55687 | 4.7% |
| Other values (12) | 319090 |
| id | age | height | weight | ap_hi | ap_lo | age_years | bmi | gender | cholesterol | gluc | smoke | alco | active | cardio | bp_category | bp_category_encoded | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | 1.000 | 0.003 | -0.002 | -0.002 | 0.003 | -0.001 | 0.003 | -0.001 | 0.013 | 0.006 | 0.000 | 0.004 | 0.000 | 0.006 | 0.006 | 0.006 | 0.006 |
| age | 0.003 | 1.000 | -0.082 | 0.061 | 0.222 | 0.157 | 0.999 | 0.107 | 0.052 | 0.113 | 0.071 | 0.048 | 0.028 | 0.014 | 0.240 | 0.112 | 0.112 |
| height | -0.002 | -0.082 | 1.000 | 0.314 | 0.021 | 0.031 | -0.084 | -0.183 | 0.415 | 0.031 | 0.012 | 0.169 | 0.089 | 0.014 | 0.017 | 0.037 | 0.037 |
| weight | -0.002 | 0.061 | 0.314 | 1.000 | 0.276 | 0.249 | 0.063 | 0.848 | 0.169 | 0.096 | 0.082 | 0.068 | 0.064 | 0.017 | 0.165 | 0.135 | 0.135 |
| ap_hi | 0.003 | 0.222 | 0.021 | 0.276 | 1.000 | 0.741 | 0.223 | 0.278 | 0.087 | 0.174 | 0.089 | 0.031 | 0.038 | 0.021 | 0.463 | 0.678 | 0.678 |
| ap_lo | -0.001 | 0.157 | 0.031 | 0.249 | 0.741 | 1.000 | 0.158 | 0.244 | 0.072 | 0.131 | 0.065 | 0.025 | 0.045 | 0.009 | 0.365 | 0.723 | 0.723 |
| age_years | 0.003 | 0.999 | -0.084 | 0.063 | 0.223 | 0.158 | 1.000 | 0.110 | 0.051 | 0.112 | 0.070 | 0.048 | 0.029 | 0.014 | 0.240 | 0.112 | 0.112 |
| bmi | -0.001 | 0.107 | -0.183 | 0.848 | 0.278 | 0.244 | 0.110 | 1.000 | 0.115 | 0.093 | 0.072 | 0.031 | 0.000 | 0.020 | 0.131 | 0.086 | 0.086 |
| gender | 0.013 | 0.052 | 0.415 | 0.169 | 0.087 | 0.072 | 0.051 | 0.115 | 1.000 | 0.037 | 0.021 | 0.338 | 0.171 | 0.003 | 0.005 | 0.080 | 0.080 |
| cholesterol | 0.006 | 0.113 | 0.031 | 0.096 | 0.174 | 0.131 | 0.112 | 0.093 | 0.037 | 1.000 | 0.393 | 0.024 | 0.043 | 0.012 | 0.221 | 0.122 | 0.122 |
| gluc | 0.000 | 0.071 | 0.012 | 0.082 | 0.089 | 0.065 | 0.070 | 0.072 | 0.021 | 0.393 | 1.000 | 0.019 | 0.029 | 0.011 | 0.091 | 0.063 | 0.063 |
| smoke | 0.004 | 0.048 | 0.169 | 0.068 | 0.031 | 0.025 | 0.048 | 0.031 | 0.338 | 0.024 | 0.019 | 1.000 | 0.338 | 0.025 | 0.016 | 0.020 | 0.020 |
| alco | 0.000 | 0.028 | 0.089 | 0.064 | 0.038 | 0.045 | 0.029 | 0.000 | 0.171 | 0.043 | 0.029 | 0.338 | 1.000 | 0.024 | 0.008 | 0.030 | 0.030 |
| active | 0.006 | 0.014 | 0.014 | 0.017 | 0.021 | 0.009 | 0.014 | 0.020 | 0.003 | 0.012 | 0.011 | 0.025 | 0.024 | 1.000 | 0.038 | 0.014 | 0.014 |
| cardio | 0.006 | 0.240 | 0.017 | 0.165 | 0.463 | 0.365 | 0.240 | 0.131 | 0.005 | 0.221 | 0.091 | 0.016 | 0.008 | 0.038 | 1.000 | 0.373 | 0.373 |
| bp_category | 0.006 | 0.112 | 0.037 | 0.135 | 0.678 | 0.723 | 0.112 | 0.086 | 0.080 | 0.122 | 0.063 | 0.020 | 0.030 | 0.014 | 0.373 | 1.000 | 1.000 |
| bp_category_encoded | 0.006 | 0.112 | 0.037 | 0.135 | 0.678 | 0.723 | 0.112 | 0.086 | 0.080 | 0.122 | 0.063 | 0.020 | 0.030 | 0.014 | 0.373 | 1.000 | 1.000 |
| id | age | gender | height | weight | ap_hi | ap_lo | cholesterol | gluc | smoke | alco | active | cardio | age_years | bmi | bp_category | bp_category_encoded | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 18393 | 2 | 168 | 62.0 | 110 | 80 | 1 | 1 | 0 | 0 | 1 | 0 | 50 | 21.967120 | Hypertension Stage 1 | Hypertension Stage 1 |
| 1 | 1 | 20228 | 1 | 156 | 85.0 | 140 | 90 | 3 | 1 | 0 | 0 | 1 | 1 | 55 | 34.927679 | Hypertension Stage 2 | Hypertension Stage 2 |
| 2 | 2 | 18857 | 1 | 165 | 64.0 | 130 | 70 | 3 | 1 | 0 | 0 | 0 | 1 | 51 | 23.507805 | Hypertension Stage 1 | Hypertension Stage 1 |
| 3 | 3 | 17623 | 2 | 169 | 82.0 | 150 | 100 | 1 | 1 | 0 | 0 | 1 | 1 | 48 | 28.710479 | Hypertension Stage 2 | Hypertension Stage 2 |
| 4 | 4 | 17474 | 1 | 156 | 56.0 | 100 | 60 | 1 | 1 | 0 | 0 | 0 | 0 | 47 | 23.011177 | Normal | Normal |
| 5 | 8 | 21914 | 1 | 151 | 67.0 | 120 | 80 | 2 | 2 | 0 | 0 | 0 | 0 | 60 | 29.384676 | Hypertension Stage 1 | Hypertension Stage 1 |
| 6 | 9 | 22113 | 1 | 157 | 93.0 | 130 | 80 | 3 | 1 | 0 | 0 | 1 | 0 | 60 | 37.729725 | Hypertension Stage 1 | Hypertension Stage 1 |
| 7 | 12 | 22584 | 2 | 178 | 95.0 | 130 | 90 | 3 | 3 | 0 | 0 | 1 | 1 | 61 | 29.983588 | Hypertension Stage 1 | Hypertension Stage 1 |
| 8 | 13 | 17668 | 1 | 158 | 71.0 | 110 | 70 | 1 | 1 | 0 | 0 | 1 | 0 | 48 | 28.440955 | Normal | Normal |
| 9 | 14 | 19834 | 1 | 164 | 68.0 | 110 | 60 | 1 | 1 | 0 | 0 | 0 | 0 | 54 | 25.282570 | Normal | Normal |
| id | age | gender | height | weight | ap_hi | ap_lo | cholesterol | gluc | smoke | alco | active | cardio | age_years | bmi | bp_category | bp_category_encoded | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68195 | 99986 | 15094 | 1 | 168 | 72.0 | 110 | 70 | 1 | 1 | 0 | 0 | 1 | 1 | 41 | 25.510204 | Normal | Normal |
| 68196 | 99988 | 20609 | 1 | 159 | 72.0 | 130 | 90 | 2 | 2 | 0 | 0 | 1 | 0 | 56 | 28.479886 | Hypertension Stage 1 | Hypertension Stage 1 |
| 68197 | 99990 | 18792 | 1 | 161 | 56.0 | 170 | 90 | 1 | 1 | 0 | 0 | 1 | 1 | 51 | 21.604105 | Hypertension Stage 2 | Hypertension Stage 2 |
| 68198 | 99991 | 19699 | 1 | 172 | 70.0 | 130 | 90 | 1 | 1 | 0 | 0 | 1 | 1 | 53 | 23.661439 | Hypertension Stage 1 | Hypertension Stage 1 |
| 68199 | 99992 | 21074 | 1 | 165 | 80.0 | 150 | 80 | 1 | 1 | 0 | 0 | 1 | 1 | 57 | 29.384757 | Hypertension Stage 1 | Hypertension Stage 1 |
| 68200 | 99993 | 19240 | 2 | 168 | 76.0 | 120 | 80 | 1 | 1 | 1 | 0 | 1 | 0 | 52 | 26.927438 | Hypertension Stage 1 | Hypertension Stage 1 |
| 68201 | 99995 | 22601 | 1 | 158 | 126.0 | 140 | 90 | 2 | 2 | 0 | 0 | 1 | 1 | 61 | 50.472681 | Hypertension Stage 2 | Hypertension Stage 2 |
| 68202 | 99996 | 19066 | 2 | 183 | 105.0 | 180 | 90 | 3 | 1 | 0 | 1 | 0 | 1 | 52 | 31.353579 | Hypertension Stage 2 | Hypertension Stage 2 |
| 68203 | 99998 | 22431 | 1 | 163 | 72.0 | 135 | 80 | 1 | 2 | 0 | 0 | 0 | 1 | 61 | 27.099251 | Hypertension Stage 1 | Hypertension Stage 1 |
| 68204 | 99999 | 20540 | 1 | 170 | 72.0 | 120 | 80 | 2 | 1 | 0 | 0 | 1 | 0 | 56 | 24.913495 | Hypertension Stage 1 | Hypertension Stage 1 |